Robust Semi-Supervised Fake News Recognition by Effective Augmentations and Ensemble of Diverse Deep Learners
نویسندگان
چکیده
Nowadays, most people obtain information from social media networks, where news accompanied by photos and videos attracts readers more than traditional ones. However, these advantages are often misused some publishers to disseminate fake rapidly, thereby adversely affecting individuals societies. Thus, the early detection of posts is crucial. Developing an automatic content-based detector ideal way overcome this issue. Given that generation rate in drastic labeling a huge amount data required fully supervised models expensive time consuming, not beneficial real applications. To address limitation, study presents semi-supervised method utilizing ensemble diverse deep learners, effective augmentations, distribution-aware pseudo-labeling technique. Here, proposed hybrid loss function enforces learners have accurate classification performance while attending different parts content. Moreover, augmentations enhance robustness prevent overfitting effectively. Diverse utilized annotate unlabeled accurately update their parameters confident predicted curriculum way, enhancing quality pseudo labels model. we utilize encoded sentences pre-trained transformer models, such as XLNET, parameter sharing build light on common feature extractor module. Consequently, number less existing methods, experiments conducted three public datasets reveal consistently outperforms state-of-the-art with proportions labeled across all evaluated datasets.
منابع مشابه
Semi-Supervised Ensemble Ranking
Ranking plays a central role in many Web search and information retrieval applications. Ensemble ranking, sometimes called meta-search, aims to improve the retrieval performance by combining the outputs from multiple ranking algorithms. Many ensemble ranking approaches employ supervised learning techniques to learn appropriate weights for combining multiple rankers. The main shortcoming with th...
متن کاملDeep Web Search Interface Identification: A Semi-Supervised Ensemble Approach
To surface the Deep Web, one crucial task is to predict whether a given web page has a search interface (searchable HyperText Markup Language (HTML) form) or not. Previous studies have focused on supervised classification with labeled examples. However, labeled data are scarce, hard to get and requires tedious manual work, while unlabeled HTML forms are abundant and easy to obtain. In this rese...
متن کاملSemi-supervised Clustering Ensemble by Voting
— Clustering ensemble is one of the most recent advances in unsupervised learning. It aims to combine the clustering results obtained using different algorithms or from different runs of the same clustering algorithm for the same data set, this is accomplished using on a consensus function, the efficiency and accuracy of this method has been proven in many works in literature. In the first part...
متن کاملFake News Detection using Stacked Ensemble of Classifiers
Fake news has become a hotly debated topic in journalism. In this paper, we present our entry to the 2017 Fake News Challenge which models the detection of fake news as a stance classification task that finished in 11th place on the leader board. Our entry is an ensemble system of classifiers developed by students in the context of their coursework. We show how we used the stacking ensemble met...
متن کاملSemi-supervised deep learning by metric embedding
Deep networks are successfully used as classification models yielding state-ofthe-art results when trained on a large number of labeled samples. These models, however, are usually much less suited for semi-supervised problems because of their tendency to overfit easily when trained on small amounts of data. In this work we will explore a new training objective that is targeting a semi-supervise...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2023
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2023.3278323